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4 <110> APPLICANT: Chen, Hongyun 

5 Le Bihan, Stephana 

7 <120> TITLE OF INVENTION: NOVEL ABCG4 TRANSPORTER AND 
10 <130> FILE REFERENCE: 100103.406 

12 <140> CURRENT APPLICATION NUMBER: US/10/090,455 

13 <141> CURRENT FILING DATE ^^U0j0^03 -01 
15 <160> NUMBER OF SEQ ID N0^17^ 

17 <170> SOFTWARE: FastSEQ forWin^ws Version 4.0 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 3455 

21 <212> TYPE: DNA 

22 <213> ORGANISM: Homo sapiens 

24 <400> SEQUENCE: 1 

25 gccaccatgg cggagaaggc gctggaggcc gtgggctgtg gactagggcc 

26 gccatggccg tgacgctgga ggacggggcg gaaccccctg tgctgaccac 

27 aaggtggaga accacatcac tgaagcccag cgcttctccc acctgcccaa 

28 gtggacatcg agttcgtgga gctgtcctat tccgtgcggg aggggccctg 

29 aggggttata agacccttct caagtgcctc tcaggtaaat tctgccgccg 

30 ggcatcatgg gcccctcagg ggctggcaag tctacattca tgaacatctt 

31 agggagtctg gaatgaaggg gcagatcctg gttaatggaa ggccacggga 

32 ttccgcaaga tgtcctgcta catcatgcaa gatgacatgc tgctgccgca 

33 ttggaagcca tgatggtctc tgctaacctg aatcttactg agaatcccga 

34 gatctcgtga cagagatcct gacggcactg ggcctgatgt cgtgctccca 

35 gccctgctct ctggcgggca gaggaagcgt ctggccatcg ccctggagct 

36 ccgcctgtca tgttctttga tgagcccacc agtggtctgg atagcgcctc 

37 gtggtgtccc tcatgaagtc cctggcacag gggggccgta ccatcatctg 

38 cagcccagtg ccaagctctt tgagatgttt gacaagctct acatcctgag 

39 tgcatcttca aaggagtggt caccaacctg atcccctatc taaagggact 

40 tgccccacct accacaaccc ggctgacttc atcatcgagg tggcctctgg 

41 gacctgaacc ccatgttgtt cagggctgtg cagaatgggc tgtgcgctat 

42 aagagcagcc ctgagaagaa cgaggtccct gccccatgcc ctccttgtcc 

43 gatcccattg aaagccacac ctttgccacc agcaccctca cacagttctg 

44 aagaggacct tcctgtccat cctcagggac acggtcctga cccacctacg 

45 cacgtggtta ttggcgtgct catcggcctc ctctacctgc atattggcga 

46 aaggtcttca acaacaccgg ctgcctcttc ttctccatgc tgttcctcat 

47 ctcatgccaa ctgtgctcac cttcccctta gagatggcgg tcttcatgag 

48 aactactggt acagcctcaa agcgtattac ctggccaaga ccatggctga 

49 caggtggtgt gtccggtggt ctactgcagc attgtgtact ggatgacggg 

50 gagaccagcc gcttcctgct cttctcagcc ctggccaccg ccaccgcctt 

51 tctttggggc tgctgatcgg agctgcttcc aactccctac aggtggccac 

52 ccagttaccg ccatccctgt cctcttgttc tccggcttct ttgtcagctt 

53 cccacttacc tgcaatggag ctcctatctc tcctatgtca ggtatggctt 

54 atcctgacga tctatggcat ggagcgagga gacctgacat gtttagagga 



USES THEREOF 



gggggctgtg 
gcacctgaag 
gcgctcagcc 
ctggcgcaaa 
ggagctgatt 
ggcaggatac 
gctgaggacc 
cctcacggtg 
tgtgaaaaac 
cacgaggaca 
ggtcaacaac 
ttgtttccaa 
caccatccac 
ccagggtcag 
cggcttgcat 
cgagtatgga 
ggctgagaag 
tccggaagtg 
catcctcttc 
gttcatgtcc 
cgatgccagc 
gttcgccgcc 
ggagcacctc 
cgtgcccttt 
ccagcccgct 
ggtggcccaa 
ttttgtgggc 
caagaccatc 
tgagggtgtg 
acgctgcccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



file://C:\Crf3\Outhold\VsrJ090455.htm 



3/19/02 



Page 2 of 7 



RAW SEQUENCE LISTING DATE: 03/19/2002 

PATENT APPLICATION: US/10/090,455 TIME: 15:56:14 

Input Set : A:\406.app 

Output Set: N:\CRF3\03192002\J090455.raw 

55 ttccgggagc cacagagcat cctccgagcg ctggatgtgg aggatgccaa gctctacatg 1860 

56 gacttcctgg tcttgggcat cttcttccta gccctgcggc tgctggccta ccttgtgctg 1920 

57 cgttaccggg tcaagtcaga gagatagagg cttgccccag cctgtacccc agcccctgca 1980 

58 gcaggaagcc cccagtccca gccctttggg actgttttaa ccttatagac ttgggcactg 2040 

59 gttcctggcg gggctatcct ctcctccctt ggctcctcca caggctggct gtcggactgc 2100 

60 gctcccagcc tgggctctgg gagtgggggc tccagccctc cccactatgc ccaggagtct 2160 

61 tcccaagttg atgcggtttg tagcttcctc cctactctct ccaacacctg catgcaaaga 2220 

62 ctactgggag gctgctgcct ccttcctgcc catggcaccc tcctctgctg tctgcctggg 2280 

63 agccctaggc tctctagggc cccacttaca actgaccaaa gtggccccct ctgggggtcc 2 340 

64 ccaccacaca agtgtttgta aactgggctg ctataaggtt ggagttccag ggctgggccc 2400 

65 tggtggagtc cactggaagt cccattatgg atgttgaaat ggacagggaa ggactctgga 24 60 

66 agtctcttcc tcctcctcct cttctctcca cccctagacc ctggctgact tggacaatct 2520 

67 gccaggacag aagctgggtt ttctgtctag gtcaccactc ccaatcctgg ggattggaga 2580 

68 ggcctggggc tgtgggatgc cccatccccc tccccatcac ctttggtggg ggcagggcct 2640 

69 ggtggcacct gtgcaataat gtctgtgttt ctctcccacc tgccactgga actggagaat 2700 

70 gcactttatt ctgggcgggg ggtgagtggg ggaagaccca accctccttt ctcgctgccc 2760 

71 ctaacgcatg cacggtctcg tgatgctccc tccctctccg gagtgacagg cacatacatg 2820 

72 agaacaggcc atctcagccc tacacacttg ccatccccta cagcacagag gaagagtgat 2880 

73 ggtggcatgc tggtggtggc gggtgctggt gggaggacag tgccaacctc ctcctgggga 2940 

74 tcccatgttg gagactctaa ggataaggct ggtgctgccc agggtgtcta caggaactgc 3000 

75 aggtgtctac ccccaagtct tccctcctcc caagccaggg gtggcacagg gcactagatc 3060 

76 cctggagttc aggaaccaac acaagcacaa ccacgggcat aagttggcct tggccactgc 3120 

77 cacccacggc cctccttttg tgctccatgc tggcatcttc actcccctac cccttcccca 3180 

78 gccactgctg ctcattcaaa cttctgtcca tgtccctcca ctgttcctat cagcaggtgg 3240 

79 cccctgggca tcagaacagc ctgccctggg caccaggtgg cagacacact cagagcatgt 3300 

80 ctggctttcc tggtgggtcc aggctcattc tgcttctgat ttcccctccc ccagggctca 3360 

81 ttttccccct ttttcctgta cacatccctg tctacctcct ctcaccctgc cacagattct 3420 

82 tcctatcaca cagggatgcc agttgtattt gtggg 3455 

84 <210> SEQ ID NO: 2 

85 <211>' LENGTH: 646 
"86 <212> TYPE: PRT 

87 <213> ORGANISM: Homo sapiens 

89 <400> SEQUENCE: 2 

90 Met Ala Glu Lys Ala Leu Glu Ala Val Gly Cys Gly Leu Gly Pro Gly 

91 1 5 10 15 

92 Ala Val Ala Met Ala Val Thr Leu Glu Asp Gly Ala Glu Pro Pro Val 

93 20 25 30 

94 Leu Thr Thr His Leu Lys Lys Val Glu Asn His lie Thr Glu Ala Gin 

95 35 40 45 

96 Arg Phe Ser His Leu Pro Lys Arg Ser Ala Val Asp lie Glu Phe Val 

97 50 55 60 

98 Glu Leu Ser Tyr Ser Val Arg Glu Gly Pro Cys Trp Arg Lys Arg Gly 

99 65 70 75 80 

100 Tyr Lys Thr Leu Leu Lys Cys Leu Ser Gly Lys Phe Cys Arg Arg Glu 

101 85 90 95 

102 Leu lie Gly lie Met Gly Pro Ser Gly Ala Gly Lys Ser Thr Phe Met 

103 100 105 110 

104 Asn lie Leu Ala Gly Tyr Arg Glu Ser Gly Met Lys Gly Gin lie Leu 

105 115 120 125 
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155 515 520 525 
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171 645 

174 <210> SEQ ID NO: 3 

175 <211> LENGTH: 1941 

176 <212> TYPE: DNA 

177 <213> ORGANISM: Homo sapiens 

179 <400> SEQUENCE: 3 

180 atggcggaga aggcgctgga ggccgtgggc tgtggactag ggccgggggc tgtggccatg 60 

181 gccgtgacgc tggaggacgg ggcggaaccc cctgtgctga ccacgcacct gaagaaggtg 120 

182 gagaaccaca tcactgaagc ccagcgcttc tcccacctgc ccaagcgctc agccgtggac 180 

183 atcgagttcg tggagctgtc ctattccgtg cgggaggggc cctgctggcg caaaaggggt 240 

184 tataagaccc ttctcaagtg cctctcaggt aaattctgcc gccgggagct gattggcatc 300 

185 atgggcccct caggggctgg caagtctaca ttcatgaaca tcttggcagg atacagggag 360 

186 tctggaatga aggggcagat cctggttaat ggaaggccac gggagctgag gaccttccgc 4 20 

187 aagatgtcct gctacatcat gcaagatgac atgctgctgc cgcacctcac ggtgttggaa 480 

188 gccatgatgg tctctgctaa cctgaatctt actgagaatc ccgatgtgaa aaacgatctc 540 

189 gtgacagaga tcctgacggc actgggcctg atgtcgtgct cccacacgag gacagccctg 600 

190 ctctctggcg ggcagaggaa gcgtctggcc atcgccctgg agctggtcaa caacccgcct 660 

191 gtcatgttct ttgatgagcc caccagtggt ctggatagcg cctcttgttt ccaagtggtg 720 

192 tccctcatga agtccctggc acaggggggc cgtaccatca tctgcaccat ccaccagccc 780 . 

193 agtgccaagc tctttgagat gtttgacaag ctctacatcc tgagccaggg tcagtgcatc 840 

194 ttcaaaggag tggtcaccaa cctgatcccc tatctaaagg gactcggctt gcattgcccc 900 

195 acctaccaca acccggctga cttcatcatc gaggtggcct ctggcgagta tggagacctg 960 

196 aaccccatgt tgttcagggc tgtgcagaat gggctgtgcg ctatggctga gaagaagagc 1020 

197 agccctgaga agaacgaggt ccctgcccca tgccctcctt gtcctccgga agtggatccc 1080 

198 attgaaagcc acacctttgc caccagcacc ctcacacagt tctgcatcct cttcaagagg 1140 

199 accttcctgt ccatcctcag ggacacggtc ctgacccacc tacggttcat gtcccacgtg 1200 

200 gttattggcg tgctcatcgg cctcctctac ctgcatattg gcgacgatgc cagcaaggtc 1260 

201 ttcaacaaca ccggctgcct cttcttctcc atgctgttcc tcatgttcgc cgccctcatg 1320 

202 ccaactgtgc tcaccttccc cttagagatg gcggtcttca tgagggagca cctcaactac 1380 

203 tggtacagcc tcaaagcgta ttacctggcc aagaccatgg ctgacgtgcc ctttcaggtg 1440 

204 gtgtgtccgg tggtctactg cagcattgtg tactggatga cgggccagcc cgctgagacc 1500 

205 agccgcttcc tgctcttctc agccctggcc accgccaccg ccttggtggc ccaatctttg 1560 

206 gggctgctga tcggagctgc ttccaactcc ctacaggtgg ccacttttgt gggcccagtt 1620 
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207 accgccatcc ctgtcctctt gttctccggc ttctttgtca gcttcaagac catccccact 1680 

208 tacctgcaat ggagctccta tctctcctat gtcaggtatg gctttgaggg tgtgatcctg 1740 

209 acgatctatg gcatggagcg aggagacctg acatgtttag aggaacgctg cccgttccgg 1800 

210 gagccacaga gcatcctccg agcgctggat gtggaggatg ccaagctcta catggacttc 1860 

211 ctggtcttgg gcatcttctt cctagccctg cggctgctgg cctaccttgt gctgcgttac 1920 

212 cgggtcaagt cagagagata g 1941 

214 <210> SEQ ID NO: 4 

215 <211> LENGTH: 674 

216 <212> TYPE: PRT 

217 <213> ORGANISM: Homo sapiens 

219 <400> SEQUENCE: 4 

220 Met Ala Ala Phe Ser Val Gly Thr Ala Met Asn Ala Ser Ser Tyr Ser 

221 15 10 15 

222 Ala Glu Met Thr Glu Pro Lys Ser Val Cys Val Ser Val Asp Glu Val 

223 20 25 30 

224 Val Ser Ser Asn Met Glu Ala Thr Glu Thr Asp Leu Leu Asn Gly His 

225 35 40 45 

226 Leu Lys Lys Val Asp Asn Asn Leu Thr Glu Ala Gin Arg Phe Ser Ser 

227 50 55 60 

228 Leu Pro Arg Arg Ala Ala Val Asn lie Glu Phe Arg Asp Leu Ser Tyr 

229 65 70 75 80 

230 Ser Val Pro Glu Gly Pro Trp Trp Arg Lys Lys Gly Tyr Lys Thr Leu 

231 85 90 . 95 

232 Leu Lys Gly He Ser Gly Lys Phe Asn Ser Gly Glu Leu Val Ala He 

233 100 105 110 

234 Met Gly Pro Ser Gly Ala Gly Lys Ser Thr Leu Met Asn He Leu Ala 

235 115 120 125 

236 Gly Tyr Arg Glu Thr Gly Met Lys Gly Ala Val Leu He Asn Gly Leu 

237 130 135 140 

238 Pro Arg Asp Leu Arg Cys Phe Arg Lys Val Ser Cys Tyr He Met Gin 

239 145 150 155 160 

240 Asp Asp Met Leu Leu Pro His Leu Thr Val Gin Glu Ala Met Met Val 

241 165 170 175 

242 Ser Ala His Leu Lys Leu Gin Glu Lys Asp Glu Gly Arg Arg Glu Met 

243 180 185 190 

244 Val Lys Glu He Leu Thr Ala Leu Gly Leu Leu Ser Cys Ala Asn Thr 

245 195 200 205 

246 Arg Thr Gly Ser Leu Ser Gly Gly Gin Arg Lys Arg Leu Ala He Ala 

247 210 215 220 

248 Leu Glu Leu Val Asn Asn Pro Pro Val Met Phe Phe Asp Glu Pro Thr 

249 225 230 235 240 

250 Ser Gly Leu Asp Ser Ala Ser Cys Phe Gin Val Val Ser Leu Met Lys 

251 245 250 255 

252 Gly Leu Ala Gin Gly Gly Arg Ser He He Cys Thr He His Gin Pro 

253 260 265 270 

254 Ser Ala Lys Leu Phe Glu Leu Phe Asp Gin Leu Tyr Val Leu Ser Gin 

255 275 280 285 

256 Gly Gin Cys Val Tyr Arg Gly Lys Val Cys Asn Leu Val Pro Tyr Leu 

257 290 295 300 




Use of n and / or Xaa has been detected in the 
Sequence Listing. Review the Sequence Listing 

to ensure a nnrrocnnnHin« «w«l l:^. ^ 
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